Dataset statistics
| Number of variables | 8 |
|---|---|
| Number of observations | 1519 |
| Missing cells | 4173 |
| Missing cells (%) | 34.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 95.1 KiB |
| Average record size in memory | 64.1 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 7 |
grocery_and_pharmacy is highly overall correlated with parks_change and 4 other fields | High correlation |
parks_change is highly overall correlated with grocery_and_pharmacy and 4 other fields | High correlation |
residential_change is highly overall correlated with grocery_and_pharmacy and 4 other fields | High correlation |
retail_and_recreation is highly overall correlated with grocery_and_pharmacy and 4 other fields | High correlation |
transit_stations is highly overall correlated with grocery_and_pharmacy and 4 other fields | High correlation |
workplaces is highly overall correlated with grocery_and_pharmacy and 4 other fields | High correlation |
daily_cases has 542 (35.7%) missing values | Missing |
retail_and_recreation has 545 (35.9%) missing values | Missing |
grocery_and_pharmacy has 545 (35.9%) missing values | Missing |
parks_change has 545 (35.9%) missing values | Missing |
transit_stations has 545 (35.9%) missing values | Missing |
workplaces has 906 (59.6%) missing values | Missing |
residential_change has 545 (35.9%) missing values | Missing |
Date has unique values | Unique |
daily_cases has 399 (26.3%) zeros | Zeros |
grocery_and_pharmacy has 35 (2.3%) zeros | Zeros |
residential_change has 24 (1.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-16 03:33:58.510651 |
|---|---|
| Analysis finished | 2024-06-16 03:34:08.053233 |
| Duration | 9.54 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
Date
Date
UNIQUE 
| Distinct | 1519 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.0 KiB |
| Minimum | 2020-02-15 00:00:00 |
|---|---|
| Maximum | 2024-04-12 00:00:00 |
daily_cases
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 357 |
|---|---|
| Distinct (%) | 36.5% |
| Missing | 542 |
| Missing (%) | 35.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1672.7247 |
| Minimum | 0 |
|---|---|
| Maximum | 225694 |
| Zeros | 399 |
| Zeros (%) | 26.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 239 |
| 95-th percentile | 3985.8 |
| Maximum | 225694 |
| Range | 225694 |
| Interquartile range (IQR) | 239 |
Descriptive statistics
| Standard deviation | 12218.728 |
|---|---|
| Coefficient of variation (CV) | 7.3046855 |
| Kurtosis | 260.2371 |
| Mean | 1672.7247 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 15.419289 |
| Sum | 1634252 |
| Variance | 1.4929731 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 399 | |
| 1 | 49 | 3.2% |
| 2 | 27 | 1.8% |
| 3 | 12 | 0.8% |
| 4 | 9 | 0.6% |
| 6 | 9 | 0.6% |
| 5 | 7 | 0.5% |
| 8 | 7 | 0.5% |
| 9 | 6 | 0.4% |
| 14 | 5 | 0.3% |
| Other values (347) | 447 | |
| (Missing) | 542 |
| Value | Count | Frequency (%) |
| 0 | 399 | |
| 1 | 49 | 3.2% |
| 2 | 27 | 1.8% |
| 3 | 12 | 0.8% |
| 4 | 9 | 0.6% |
| 5 | 7 | 0.5% |
| 6 | 9 | 0.6% |
| 7 | 5 | 0.3% |
| 8 | 7 | 0.5% |
| 9 | 6 | 0.4% |
| Value | Count | Frequency (%) |
| 225694 | 1 | |
| 211072 | 1 | |
| 189328 | 1 | |
| 40937 | 1 | |
| 32650 | 1 | |
| 32317 | 1 | |
| 31899 | 1 | |
| 31380 | 1 | |
| 30157 | 1 | |
| 29835 | 1 |
retail_and_recreation
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 97 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 545 |
| Missing (%) | 35.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -25.291581 |
| Minimum | -87 |
|---|---|
| Maximum | 21 |
| Zeros | 13 |
| Zeros (%) | 0.9% |
| Negative | 901 |
| Negative (%) | 59.3% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -87 |
|---|---|
| 5-th percentile | -79 |
| Q1 | -35.75 |
| median | -19.5 |
| Q3 | -9 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 108 |
| Interquartile range (IQR) | 26.75 |
Descriptive statistics
| Standard deviation | 22.534277 |
|---|---|
| Coefficient of variation (CV) | -0.89097935 |
| Kurtosis | 0.6320948 |
| Mean | -25.291581 |
| Median Absolute Deviation (MAD) | 12.5 |
| Skewness | -1.0586386 |
| Sum | -24634 |
| Variance | 507.79362 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -14 | 36 | 2.4% |
| -7 | 32 | 2.1% |
| -8 | 30 | 2.0% |
| -21 | 29 | 1.9% |
| -15 | 29 | 1.9% |
| -12 | 28 | 1.8% |
| -13 | 26 | 1.7% |
| -16 | 23 | 1.5% |
| -10 | 23 | 1.5% |
| -9 | 23 | 1.5% |
| Other values (87) | 695 | |
| (Missing) | 545 |
| Value | Count | Frequency (%) |
| -87 | 1 | 0.1% |
| -86 | 5 | 0.3% |
| -85 | 6 | 0.4% |
| -84 | 16 | |
| -83 | 5 | 0.3% |
| -82 | 5 | 0.3% |
| -81 | 7 | |
| -80 | 3 | 0.2% |
| -79 | 6 | 0.4% |
| -78 | 5 | 0.3% |
| Value | Count | Frequency (%) |
| 21 | 1 | 0.1% |
| 19 | 2 | 0.1% |
| 13 | 2 | 0.1% |
| 10 | 4 | |
| 9 | 2 | 0.1% |
| 8 | 4 | |
| 7 | 3 | |
| 6 | 6 | |
| 5 | 6 | |
| 4 | 6 |
grocery_and_pharmacy
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 87 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 545 |
| Missing (%) | 35.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -9.9507187 |
| Minimum | -67 |
|---|---|
| Maximum | 22 |
| Zeros | 35 |
| Zeros (%) | 2.3% |
| Negative | 634 |
| Negative (%) | 41.7% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -67 |
|---|---|
| 5-th percentile | -57 |
| Q1 | -17 |
| median | -5 |
| Q3 | 2 |
| 95-th percentile | 12 |
| Maximum | 22 |
| Range | 89 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 18.894678 |
|---|---|
| Coefficient of variation (CV) | -1.8988255 |
| Kurtosis | 1.439318 |
| Mean | -9.9507187 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -1.3313865 |
| Sum | -9692 |
| Variance | 357.00887 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 37 | 2.4% |
| -1 | 36 | 2.4% |
| 0 | 35 | 2.3% |
| 1 | 34 | 2.2% |
| -9 | 32 | 2.1% |
| -5 | 31 | 2.0% |
| 4 | 30 | 2.0% |
| -8 | 29 | 1.9% |
| -7 | 29 | 1.9% |
| -3 | 29 | 1.9% |
| Other values (77) | 652 | |
| (Missing) | 545 |
| Value | Count | Frequency (%) |
| -67 | 1 | 0.1% |
| -66 | 7 | |
| -65 | 3 | 0.2% |
| -64 | 7 | |
| -63 | 4 | 0.3% |
| -62 | 10 | |
| -61 | 6 | |
| -60 | 2 | 0.1% |
| -59 | 2 | 0.1% |
| -58 | 4 | 0.3% |
| Value | Count | Frequency (%) |
| 22 | 2 | 0.1% |
| 21 | 1 | 0.1% |
| 20 | 2 | 0.1% |
| 19 | 4 | |
| 18 | 3 | 0.2% |
| 17 | 4 | |
| 16 | 4 | |
| 15 | 8 | |
| 14 | 7 | |
| 13 | 7 |
parks_change
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 112 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 545 |
| Missing (%) | 35.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -22.379877 |
| Minimum | -83 |
|---|---|
| Maximum | 38 |
| Zeros | 15 |
| Zeros (%) | 1.0% |
| Negative | 846 |
| Negative (%) | 55.7% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -83 |
|---|---|
| 5-th percentile | -72.35 |
| Q1 | -33 |
| median | -19 |
| Q3 | -8 |
| 95-th percentile | 12 |
| Maximum | 38 |
| Range | 121 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 22.425197 |
|---|---|
| Coefficient of variation (CV) | -1.002025 |
| Kurtosis | 0.51102019 |
| Mean | -22.379877 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.61912227 |
| Sum | -21798 |
| Variance | 502.88946 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -17 | 30 | 2.0% |
| -9 | 29 | 1.9% |
| -31 | 27 | 1.8% |
| -15 | 26 | 1.7% |
| -14 | 26 | 1.7% |
| -30 | 24 | 1.6% |
| -18 | 24 | 1.6% |
| -10 | 23 | 1.5% |
| -5 | 22 | 1.4% |
| -16 | 21 | 1.4% |
| Other values (102) | 722 | |
| (Missing) | 545 |
| Value | Count | Frequency (%) |
| -83 | 1 | 0.1% |
| -82 | 1 | 0.1% |
| -81 | 2 | 0.1% |
| -80 | 11 | |
| -79 | 12 | |
| -78 | 7 | |
| -77 | 4 | 0.3% |
| -76 | 3 | 0.2% |
| -75 | 2 | 0.1% |
| -74 | 3 | 0.2% |
| Value | Count | Frequency (%) |
| 38 | 1 | 0.1% |
| 36 | 1 | 0.1% |
| 32 | 1 | 0.1% |
| 31 | 1 | 0.1% |
| 29 | 1 | 0.1% |
| 24 | 1 | 0.1% |
| 23 | 2 | |
| 22 | 4 | |
| 21 | 2 | |
| 20 | 1 | 0.1% |
transit_stations
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 115 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 545 |
| Missing (%) | 35.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -30.336756 |
| Minimum | -89 |
|---|---|
| Maximum | 56 |
| Zeros | 11 |
| Zeros (%) | 0.7% |
| Negative | 881 |
| Negative (%) | 58.0% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -89 |
|---|---|
| 5-th percentile | -83 |
| Q1 | -52 |
| median | -24 |
| Q3 | -9 |
| 95-th percentile | 4 |
| Maximum | 56 |
| Range | 145 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 26.621583 |
|---|---|
| Coefficient of variation (CV) | -0.8775356 |
| Kurtosis | -0.63751836 |
| Mean | -30.336756 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.36279596 |
| Sum | -29548 |
| Variance | 708.70868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -18 | 28 | 1.8% |
| -7 | 26 | 1.7% |
| -55 | 25 | 1.6% |
| -5 | 23 | 1.5% |
| -3 | 22 | 1.4% |
| -4 | 22 | 1.4% |
| -10 | 21 | 1.4% |
| -6 | 20 | 1.3% |
| -12 | 20 | 1.3% |
| -20 | 20 | 1.3% |
| Other values (105) | 747 | |
| (Missing) | 545 |
| Value | Count | Frequency (%) |
| -89 | 4 | 0.3% |
| -88 | 5 | 0.3% |
| -87 | 13 | |
| -86 | 11 | |
| -85 | 4 | 0.3% |
| -84 | 7 | |
| -83 | 7 | |
| -82 | 1 | 0.1% |
| -81 | 1 | 0.1% |
| -80 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 56 | 1 | 0.1% |
| 53 | 1 | 0.1% |
| 26 | 1 | 0.1% |
| 25 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 20 | 1 | 0.1% |
| 19 | 1 | 0.1% |
| 18 | 1 | 0.1% |
| 17 | 3 | |
| 16 | 2 |
workplaces
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 101 |
|---|---|
| Distinct (%) | 16.5% |
| Missing | 906 |
| Missing (%) | 59.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -8.0261011 |
| Minimum | -84 |
|---|---|
| Maximum | 30 |
| Zeros | 12 |
| Zeros (%) | 0.8% |
| Negative | 339 |
| Negative (%) | 22.3% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -84 |
|---|---|
| 5-th percentile | -62 |
| Q1 | -17 |
| median | -2 |
| Q3 | 10 |
| 95-th percentile | 24 |
| Maximum | 30 |
| Range | 114 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 25.346797 |
|---|---|
| Coefficient of variation (CV) | -3.158046 |
| Kurtosis | 0.17826171 |
| Mean | -8.0261011 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.98266616 |
| Sum | -4920 |
| Variance | 642.4601 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -2 | 24 | 1.6% |
| -5 | 17 | 1.1% |
| -1 | 17 | 1.1% |
| -4 | 17 | 1.1% |
| 6 | 16 | 1.1% |
| -7 | 14 | 0.9% |
| -3 | 14 | 0.9% |
| 3 | 14 | 0.9% |
| -8 | 14 | 0.9% |
| 4 | 13 | 0.9% |
| Other values (91) | 453 | |
| (Missing) | 906 |
| Value | Count | Frequency (%) |
| -84 | 1 | |
| -82 | 1 | |
| -80 | 1 | |
| -76 | 1 | |
| -74 | 1 | |
| -73 | 1 | |
| -71 | 1 | |
| -70 | 2 | |
| -69 | 1 | |
| -67 | 1 |
| Value | Count | Frequency (%) |
| 30 | 2 | 0.1% |
| 29 | 3 | 0.2% |
| 28 | 6 | |
| 27 | 3 | 0.2% |
| 26 | 5 | |
| 25 | 5 | |
| 24 | 8 | |
| 23 | 5 | |
| 22 | 4 | |
| 21 | 8 |
residential_change
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 50 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 545 |
| Missing (%) | 35.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3285421 |
| Minimum | -13 |
|---|---|
| Maximum | 43 |
| Zeros | 24 |
| Zeros (%) | 1.6% |
| Negative | 139 |
| Negative (%) | 9.2% |
| Memory size | 12.0 KiB |
Quantile statistics
| Minimum | -13 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 31 |
| Maximum | 43 |
| Range | 56 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 8.9660125 |
|---|---|
| Coefficient of variation (CV) | 1.2234374 |
| Kurtosis | 2.6782074 |
| Mean | 7.3285421 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.5249703 |
| Sum | 7138 |
| Variance | 80.38938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 101 | 6.6% |
| 7 | 91 | 6.0% |
| 5 | 88 | 5.8% |
| 4 | 76 | 5.0% |
| 8 | 51 | 3.4% |
| 3 | 44 | 2.9% |
| 2 | 43 | 2.8% |
| 9 | 42 | 2.8% |
| 10 | 41 | 2.7% |
| 1 | 37 | 2.4% |
| Other values (40) | 360 | |
| (Missing) | 545 |
| Value | Count | Frequency (%) |
| -13 | 1 | 0.1% |
| -8 | 5 | 0.3% |
| -7 | 5 | 0.3% |
| -6 | 7 | 0.5% |
| -5 | 21 | |
| -4 | 22 | |
| -3 | 26 | |
| -2 | 29 | |
| -1 | 23 | |
| 0 | 24 |
| Value | Count | Frequency (%) |
| 43 | 1 | 0.1% |
| 39 | 1 | 0.1% |
| 38 | 2 | 0.1% |
| 37 | 3 | 0.2% |
| 36 | 8 | |
| 35 | 9 | |
| 34 | 7 | |
| 33 | 3 | 0.2% |
| 32 | 8 | |
| 31 | 9 |
| daily_cases | grocery_and_pharmacy | parks_change | residential_change | retail_and_recreation | transit_stations | workplaces | |
|---|---|---|---|---|---|---|---|
| daily_cases | 1.000 | -0.193 | -0.016 | 0.313 | -0.223 | -0.427 | -0.406 |
| grocery_and_pharmacy | -0.193 | 1.000 | 0.795 | -0.657 | 0.757 | 0.745 | 0.625 |
| parks_change | -0.016 | 0.795 | 1.000 | -0.563 | 0.816 | 0.680 | 0.660 |
| residential_change | 0.313 | -0.657 | -0.563 | 1.000 | -0.740 | -0.795 | -0.751 |
| retail_and_recreation | -0.223 | 0.757 | 0.816 | -0.740 | 1.000 | 0.874 | 0.775 |
| transit_stations | -0.427 | 0.745 | 0.680 | -0.795 | 0.874 | 1.000 | 0.811 |
| workplaces | -0.406 | 0.625 | 0.660 | -0.751 | 0.775 | 0.811 | 1.000 |
| Date | daily_cases | retail_and_recreation | grocery_and_pharmacy | parks_change | transit_stations | workplaces | residential_change | |
|---|---|---|---|---|---|---|---|---|
| 0 | 2020-02-15 | NaN | -8.0 | -4.0 | -5.0 | -14.0 | -4.0 | 7.0 |
| 1 | 2020-02-16 | NaN | -15.0 | -9.0 | -11.0 | -20.0 | -5.0 | 8.0 |
| 2 | 2020-02-17 | NaN | -16.0 | -12.0 | -10.0 | -19.0 | 3.0 | 7.0 |
| 3 | 2020-02-18 | NaN | -14.0 | -9.0 | -9.0 | -18.0 | 10.0 | 8.0 |
| 4 | 2020-02-19 | NaN | -8.0 | -16.0 | -15.0 | -21.0 | 9.0 | 6.0 |
| 5 | 2020-02-20 | NaN | -2.0 | -18.0 | -17.0 | -5.0 | 6.0 | 4.0 |
| 6 | 2020-02-21 | NaN | -14.0 | -9.0 | -14.0 | -18.0 | -2.0 | 7.0 |
| 7 | 2020-02-22 | NaN | -8.0 | 0.0 | -3.0 | -13.0 | -2.0 | 7.0 |
| 8 | 2020-02-23 | NaN | -13.0 | -5.0 | 0.0 | -18.0 | -2.0 | 6.0 |
| 9 | 2020-02-24 | NaN | -14.0 | -12.0 | -5.0 | -18.0 | 4.0 | 6.0 |
| Date | daily_cases | retail_and_recreation | grocery_and_pharmacy | parks_change | transit_stations | workplaces | residential_change | |
|---|---|---|---|---|---|---|---|---|
| 1509 | 2024-04-03 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1510 | 2024-04-04 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1511 | 2024-04-05 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1512 | 2024-04-06 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1513 | 2024-04-07 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1514 | 2024-04-08 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1515 | 2024-04-09 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1516 | 2024-04-10 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1517 | 2024-04-11 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1518 | 2024-04-12 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |